Search CORE

2 research outputs found

Investigating and Testing Performance Issues in Deep Learning Frameworks

Author: Makkouk Tarek
Publication venue
Publication date: 23/06/2023
Field of study

Machine Learning (ML) and Deep Learning (DL) applications are becoming more popular due to the availability of DL frameworks such as PyTorch, Keras, and TensorFlow. Therefore, the quality of DL frameworks is essential to ensure DL/ML application quality. Given the computationally expensive nature of DL tasks (e.g., training), performance is a critical aspect of DL frameworks. However, optimizing DL frameworks may have its own unique challenges due to the peculiarities of DL (e.g., hardware integration and the nature of the computation). In this thesis, we first aim to better understand performance bugs in DL frameworks by conducting an empirical study. We conduct our study on PyTorch and TensorFlow by mining and studying their performance and non-performance bug reports from their respective GitHub repositories. We find that 1) the proportion of newly reported performance bugs increases faster than fixed performance bugs, and the ratio of performance bugs among all bugs increases over time; 2) performance bugs take more time to fix, have larger fix sizes, and more community engagement (e.g., discussion) compared to non-performance bugs; and 3) we manually derived a taxonomy of 12 categories and 19 sub-categories of the root causes of performance bugs in DL frameworks by studying all performance bug fixes. We then aim to investigate the potential of differential testing as a viable technique to detect and prevent performance bugs in DL frameworks. To do so, we train and evaluate two state-of-the-art CNN and RNN architectures (i.e., the Lenet-5 architecture on the MNIST dataset and the LSTM architecture on the IMDB movie review dataset), using different DL frameworks (i.e., PyTorch, Keras, and TensorFlow), and different configurations (i.e., the training dataset sample size, the batch size, the number of epochs, the weight initialization technique, the data type, the hardware used, the learning rate, and the dropout rate). To assess the performance of the DL models, we use a variety of performance metrics (i.e., training/inference time, hardware (CPU or GPU) usage during training/inference, and memory (RAM or GPU VRAM) usage during training/inference). Then, we compare the performance of the DL models across the DL frameworks. We train and evaluate 21,870 Lenet5 models and 21,870 LSTM models across the DL frameworks, for a grand total of 43,740 models. Our experiments took over 42 days. We find that 1) differences in performance between different DL frameworks, for the same task, may be indicative of a performance optimization opportunity/performance bug; 2) our approach is viable when training and evaluating a smaller number of DL models, which makes it more accessible for developers. Finally, we present some potential avenues for future work that aim to further study performance bugs in DL frameworks

Concordia University Research Repository

Targeting natural killer cells in cancer immunotherapy

Author: A Bishara
A Boltz
A Curti
A Fuchs
A Iannello
A Lundqvist
A Makkouk
A Marçais
A Pérez-Martínez
A Romanski
A Stojanovic
A Wiernik
A Young
AA Maghazachi
AE Zamora
AM Lesokhin
B Besse
BJ Schmiedel
BS Jones
BY Huang
C Borg
C Guillerey
C Guillerey
C Guillerey
C Kellner
C Sahm
C Zhang
CA Gerdes
Camille Guillerey
CJ Chan
CJ Chan
CM Sungur
D Mittal
DA Knorr
DC Delgado
DH Raulet
DH Raulet
DK Sojka
DM Benson Jr.
DM Benson Jr.
E Moga
E Vivier
EM Putz
EP von Strandmann
F Ghiringhelli
F Hartmann
F Romagné
FE Davies
G Bernardini
G Sconocchia
H Klingemann
H Klingemann
H Spits
HE Kohrt
HE Kohrt
HE Kohrt
I Voskoboinik
IP da Silva
J Han
J Nowak
J Rodríguez
J Rueff
JA Bowles
JE Bakema
JE Rubnitz
JR Westin
JS Miller
JW Leong
K Schönfeld
KC Conlon
L Jardine
L Martinet
L Martinet
L Ruggeri
L Wieten
L Wu
L Xu
LE Rossi
LF Porrata
LF Porrata
LJ Burns
LL Lanier
LM Weiner
LS Shahied
M Imamura
M Paolino
M Semeraro
M Vitale
MA Stacey
Mark J Smyth
MF Sanmamed
MJ Smyth
MK Gleason
MP Roberti
MR Parkhurst
N Korde
N Marquardt
N Sakamoto
N Stanietsky
N Stanietsky
N Steele
N Tarek
N Vey
ND Huntington
ND Huntington
NF Delahaye
Nicholas D Huntington
NN Shah
PA Beavis
PS Becker
PS Kim
Q Zhou
R Parameswaran
R Romee
R Romee
RA Clynes
RB Delconte
RB Delconte
RK Yang
S Genßler
S Gill
S Krieg
S Mocellin
S Nguyen
S Viaud
S Viel
SJ Blake
SK Grossenbacher
SM Davies
T Baessler
T Tonn
TA Waldmann
V Bachanova
V Groh
VS Cortez
W Deng
W Glienke
WK Weng
WL Gluck
X Wang
Y Hayakawa
Y Li
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref